Corpus: hat_wikipedia_2021

Other corpora

2.2.5 Most frequent word beginnings

The most frequent word beginnings as character N-grams for N=1...5 with Zipf's diagram


Zipf's diagram for word beginnings


Gnuplot diagram

Top Characters
word rank frequency n-gram
1 2678 1-
2 1706 M-
3 1665 p-
4 1543 A-
5 1537 a-
Top Character Bigrams
word rank frequency n-gram
1 795 ko-
2 778 Ma-
3 592 re-
4 476 de-
5 469 1,-
Top Character Trigrams
word rank frequency n-gram
1 471 kon-
2 264 Mar-
3 174 pwo-
4 170 Kon-
5 154 Cha-
Top Character 4-Grams
word rank frequency n-gram
1 124 Jean-
2 109 Mari-
3 104 kont-
4 102 kons-
5 83 konp-
Top Character 5-Grams
word rank frequency n-gram
1 78 Jean--
2 71 Marie-
3 54 Saint-
4 44 kontr-
5 42 trans-
1288 msec needed at 2024-10-24 14:00